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Bayesian procedures designed to quantify truncation errors in perturbative calculations of quan¬ 
tum chromodynamics observables are adapted to expansions in effective field theory (EFT). In the 
Bayesian approach, such truncation errors are derived from degree-of-belief (DOB) intervals for 
EFT predictions. Computation of these intervals requires specification of prior probability distribu¬ 
tions (“priors”) for the expansion coefficients. By encoding expectations about the naturalness of 
these coefficients, this framework provides a statistical interpretation of the standard EFT proce¬ 
dure where truncation errors are estimated using the order-by-order convergence of the expansion. 

It also permits exploration of the ways in which such error bars are, and are not, sensitive to 
assumptions about EFT-coefficient naturalness. We first demonstrate the calculation of Bayesian 
probability distributions for the EFT truncation error in some representative examples, and then 
focus on the application of chiral EFT to neutron-proton scattering. Epelbaum, Krebs, and MeiBner 
recently articulated explicit rules for estimating truncation errors in such EFT calculations of few- 
nucleon-system properties. We find that their basic procedure emerges generically from one class of 
naturalness priors considered, and that all such priors result in consistent quantitative predictions 
for 68% DOB intervals. We then explore several methods by which the convergence properties of the 
EFT for a set of observables may be used to check the statistical consistency of the EFT expansion 
parameter. 

PACS numbers: 02.50.-r, ll.10.Ef, 21.45.-v, 21.60.-n 


I. INTRODUCTION 

Effective field theories (EFTs) describe the physics of 
systems with a separation of scale^] A key element in 
any EFT is a power counting that organizes calculations 
into an expansion in a dimensionless parameter or pa¬ 
rameters, which are typically formed from ratios of the 
low-energy and high-energy scales in the system under 
consideration. We denote this parameter generically as 
Q. In the simplest case Q = p/A 5 is the ratio of the 
typical momentum, p, of the process of interest to the 
break-down scale, A&, of the EFT—i.e., the scale at which 
the first dynamics not explicitly included in the EFT ap¬ 
pears. Even in more complex situations with many low- 
energy scales, the EFT expansion for X can be denoted: 

00 

X = X 0 J2c n Q n , ( 1 ) 

n=0 

where Xo is the natural size of the observable X, and 
{c n } are dimensionless coefficients, some of which may 
be zero. In most EFTs the expansion 0 is inherited di¬ 
rectly from the EFT Lagrangian or potential—with suit¬ 
able additions (e.g., terms of the form Q n log(Q)) due to 
quantum corrections. In nuclear physics, however, the 
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dynamics is intrinsically non-perturbative, and there ex¬ 
ists at least some sub-class of EFT graphs that must 
be summed to all orders. The connection between the 
Lagrangian and observables is then less direct. Never¬ 
theless, a properly formulated EFT for nuclear physics 
is expected to have a Q-expansion for observables of the 
form 0 and it is the properties of such expansions which 
are our concern in this paper. 

A key benefit of the perturbative series 0 is that it 
permits estimation of the error induced by truncation at 
a finite order k : “truncation errors”. If the coefficients 
c n for an observable were to vary significantly and unsys¬ 
tematically in size, the expansion 0 would be unsuited 
to this end. However, experience, and the principle of 
naturalness, suggest that the coefficients are typically of 
order one—even in the more complex nuclear context. 

In Ref. [4] we laid out a recipe for uncertainty quantifi¬ 
cation in EFTs for nuclear physics. While they are not 
the only source of theory uncertainty, truncation errors 
are often the dominant uncertainty in EFT calculations. 
We argued that Bayesian methods [5] provide an error 
bar, with a well-founded statistical interpretation, that 
accounts for all sources of uncertainty in the EFT. In 
particular, Bayesian methods are essential to the assess¬ 
ment of truncation error: assumptions (or expectations) 
about the EFT are encoded in “prior probability distri¬ 
bution functions” (pdfs). The Bayesian approach then 
proceeds by integrating out (“marginalizing”) the coeffi¬ 
cients of omitted terms to establish the truncation error. 

The use of priors is often controversial because they can 
introduce subjective judgments about, e.g., the meaning 
of naturalness, into the computation. We argue that, on 
the contrary, introducing and stating Bayesian priors on 
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higher-order EFT coefficients renders explicit in the cal¬ 
culation assumptions that are present but typically not 
articulated. This allows such assumptions to be applied 
consistently, tested, and modified in light of new infor¬ 
mation. 

In this work we begin with the general formalism for 
computation of truncation errors discussed in the context 
of perturbative QCD (pQCD) by Cacciari and Houdeau 
in Ref. [6 and further developed in Refs. im where 
it is called CH (cf. Ref. j9] for a brief summary). We 
rederive, adapt, and extend their prescription to EFT ex¬ 
pansions. We explore several different choices of prior for 
the coefficients {c n } and examine—within some generic 
examples—how such prior choice affects the truncation- 
error estimate. We then look specifically at the nuclear 
context, focusing on the extent to which such calcula¬ 
tions justify the uncertainty quantification (UQ) proce¬ 
dure typically adopted in EFTs. This procedure has re¬ 
cently been clearly stated and applied to nucleon-nucleon 
(NN) cross sections by Epelbaum, Krebs, and MeiBner 
(henceforth EKM) in their fourth- and fifth-order appli¬ 
cations of chiral EFT to these observables mm- (j n- 
troductions to chiral EFT can be found in Refs. H2HHI-) 
Note that here we do not deal with the extent to which 
truncation errors affect the low-energy constants (LECs) 
extracted from fitting EFT expansions to data. This will 
be discussed in a separate publication PI- Our focus 
here is solely on estimating the truncation error at order 
k for the series 0, given the assumption of naturalness, 
and information on the size of the coefficients Co,..., c&. 

In Section [IIJ we provide a brief overview of the 
Bayesian rules we will need and then adapt the CH pre¬ 
scription so that it is suitable for application to EFT 
expansions. We also enlarge the set of priors considered 
by CH. In Section m this approach is applied to some 
two-body observables considered by EKM, using their 
assumed breakdown scale to compare to their error as¬ 
sessment. In Section IV we explore methods to determine 
the breakdown scale from the requirement that the EFT 
coefficients be consistent with naturalness. In Section El 
we summarize our results. 


II. ADAPTING CH TO EFT 

A. Setting up the problem 

Consider the perturbative series 0 for the observable 
X. If the series is truncated at order k then the error 
induced is XoA^, where the scaled, dimensionless pa¬ 
rameter that determines the truncation error is: 

oo 

’ ( 2 ) 

n=k -\-1 

provided the series actually converges and is not solely 
asymptotic. For sufficiently small values of Q, the first 
omitted term c/ c+ iQ /c+1 is a good estimate for A&. This 


leads to simplified formulas for the evaluation of DOB 
intervals. Below we will consider both this first-term ap¬ 
proximation and evaluations at larger Q that use several 
terms in A&. In either case this provides an estimate of 
the deviation of the series at order k from the true value 
of the observable—even if the series is asymptotic. 

In Ref. [6] Cacciari and Houdeau (hereafter “CH”) con¬ 
sidered the case that the series ([l]) is a pQCD expansion. 
The expansion is then in powers of the strong coupling, 
a s (/i), where g is a renormalization scale chosen appro¬ 
priately for the observable X. The optimal choice of fi 
is the subject of much debate and many prescriptions in 
the literature. Indeed, the variation of the truncated-at- 
order -k expression for X under an order-unity change in 
q is canonically used to estimate A&. This is justified be¬ 
cause of the truncated expansion’s residual dependence 
on the renormalization scale /q the full sum should be 
independent of /q so the variation with q contains infor¬ 
mation about omitted terms. 

CH pointed out that varying the scale q around an op¬ 
timal value q o, say, between qo/2 and 2qo, does not yield 
an uncertainty with a straightforward statistical interpre¬ 
tation. They therefore laid out a Bayesian probability- 
theory calculation of A&. Ultimately estimates from scale 
variation seem to coincide quite well with the results of 
this more rigorous probabilistic analysis. As we now de¬ 
scribe, that analysis starts with priors for {c n } that en¬ 
code the assumption that these pQCD coefficients are of 
order unity (once the typical size of the observable X is 
factored out of the expression). It modifies them in light 
of information acquired as more orders in the series for 
X are computed, and ultimately obtains a posterior pdf 
for A*;. With this posterior in hand, either the degree of 
belief (DOB) corresponding to a given interval of values 
of the truncation error, or the range of truncation errors 
corresponding to a specified DOB, can be computed. 

CH’s analysis of truncation errors in pQCD took the 
case where the series is in powers of a s , rather than Q. 
Later work 13 El introduced a scale factor A, such that 
the expansion was in powers of a s /X (e.g., because the 
expansion parameter might really be a s /7r or include a 
color factor) and a possible combinatoric factor (such as 
nl from high-order renormalon contributions). In this, 
termed the “CH prescription”, the assumption is modi¬ 
fied to say that the coefficients in the perturbative series 
of an appropriately chosen expansion parameter are dis¬ 
tributed such that they share a common upper bound. In 
EFT the rescaling by a factor A corresponds to the choice 
of a different breakdown scale for the EFT, and we dis¬ 
cuss this possibility in Section |IV| We do not consider 
the combinatoric factor, which has not been identified in 
EFT expansions for few-nucleon-system observables. 

Since one of the low-momentum scales of which Q is 
formed is the momentum at which the observable X is 
measured, the EFT expansion parameter is strongly de¬ 
pendent on kinematics. While the QCD expansion’s con¬ 
vergence can be improved by matching the scale at which 
a s is evaluated to that of the observable of interest, in 
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EFT the dependence of Q on momentum is intrinsic—not 
a matter of choice. Furthermore, the high-momentum 
scale, Afr, that specifies the radius of convergence of the 
EFT momentum expansion, may not be known a priori , 
it may only be able to be inferred from the behavior of 
the EFT series. This is a key difference between EFT 
and pQCD, since in pQCD, the value of a s can always 
be specified. In EFT a value of Q corresponding to a par¬ 
ticular momentum must be chosen, and then checked for 
consistency. Complicating the choice of an appropriate 
Afr—and concomitantly the evaluation of Q —for many 
low-energy EFTs is that at least some of the c n ’s need 
to be extracted from data, either from X itself or from 
other observables. 

These differences from the pQCD situation are re¬ 
flected in the need to check the naturalness of EFT coef¬ 
ficients for a given choice of expansion parameter, some¬ 
thing that we discuss in detail in Section IV For the 
time being we assume that A& has been determined as 
part of the steps in the EFT analysis that yielded the 
coefficients {c n : n = 0,..., k}. Empirically EKM found 
that A 5 « 600 MeV resulted in natural coefficients in 
their EFT series for neutron-proton scattering cross sec¬ 
tions uni eu. However, the non-perturbative nature 
of NN scattering makes it unclear how naturalness for 
EFT LECs results in these apparently natural values of 
{c n : n = 0,..., k} in the cross-section’s expansion. This 
connection is very clear for perturbative EFT expansions 
of interest in nuclear physics, e.g., the chiral expansion 
for the nucleon mass (see Ref. [5]) and the expansion 
for energy per particle of a dilute Fermi system with 
natural-sized scattering length [T9] . Regardless though, 
in either perturbative or non-perturbative cases, an in¬ 
correct choice of high-momentum scale, A*,, will result 
in coefficients that are not natural. This emphasizes the 
close connection between the assumption of a particular 
expansion parameter and the imposition of a naturalness 
prior. 


B. Conditional probabilities: definitions and rules 

We use the notation pr(x| I) to denote the probability 
(density) of x given information /; thus pr(Afc|co,..., c&) 
is the desired pdf for A&. The specification Co,..., sug¬ 
gests that Co is non-zero, but it is straightforward to gen¬ 
eralize the results derived here to the case where the first 
non-zero coefficient is c\ with l > 0 (as often is the case 
in QCD) or that where some intermediate coefficients are 
identically zero (as for chiral EFT in NN scattering where 
n—\ does not appear). 

Because the terminology, techniques, and manipula¬ 
tions of Bayesian statistics may be unfamiliar to our in¬ 
tended audience, we include a brief overview here of those 
aspects needed for the CH procedures mum- We in¬ 
dicate parenthetically some analogies to familiar manip¬ 
ulations in quantum mechanics. We emphasize that the 
correspondences are not to be taken literally. 


Bayesian probabilistic inference is built on the sum and 
product rules. If the set {xi} is exhaustive and exclusive 
(cf. complete and orthogonal), then the sum rule says 
that pi(xi\I) is normalized, 

F P r ( x *l-0 = 1 —* J dxpv(x\I) = 1 , (3) 

where the continuum version is integrated over the appro¬ 
priate range of x. But it further implies marginalization 
(cf. inserting a complete set of orthonormal basis states): 

W(x\l) = Yiw(x,Uj\I) , (4) 

j 


or the continuum version 

W(x\l) = Jdypr(x,y\I) = 1 , (5) 

where pr(x, y\I) is the joint probability of x and y given 
I. We will apply this repeatedly, either to introduce new 
parameters or to integrate out “nuisance” parameters. 

Expressing pr(x| I) in terms of the joint probability 
pr(x,y\I) through Eq. © leads to progress by applying 
the product rule to relate it to other pdfs: 


pr(x, y\I ) = pr(z| y, I) pr(y\I) = pr(y\x, I) pv(x\I) . (6) 


The first equality translates to: “the joint probability of 
x and y is equal to the probability of x given y and / 
times the probability of y given The second equal¬ 
ity follows by symmetry, but when rearranged becomes 
Bayes’ theorem: 


prO| y,I) 


prjyjxj) pr(x| I) 

pr(y|/) 


( 7 ) 


which here relates the posterior pv(x\y,I) to the likeli¬ 
hood pr(y|x,J) given the prior pr(x| I) and the evidence 
pi(y\I). These relations will enable us to derive the pos¬ 
terior for A*, in terms of assumed priors. 

Another implication of the product rule follows when 
x and y are mutually independent, which means that 
knowing y doesn’t affect the probability of r, so that 
pv(x\y,I) = pv(x \/). Then Eq. © tells us that 

pr(z, y\I ) —» pr(x\I) pv(y\I) . (8) 


In the following we sometimes omit the explicit /, but 
the specification of prior information should always be 
assumed. 


C. CH synopsis and EFT correspondence 

We consider multiple priors that reflect the expecta¬ 
tion that all coefficients in the expansion of an observ¬ 
able in powers of Q are of roughly the same size—or, 
more precisely, they have a distribution with a character¬ 
istic size. The fundamental assumption made by Cacciari 
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TABLE I. Prior pdfs. 
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and Houdeau in Ref. [ 6 ] is that all coefficients of in 
the pQCD series are roughly the same size, which is im¬ 
plemented by treating them as random variables having 
a shared distribution with upper bound c. This assump¬ 
tion is motivated by empirical evidence from the behav¬ 
ior of such series. But it may not be correct for EFT 
expansions, where the form 0 is expected to result in 
coefficients which are 0 ( 1 ), not arbitrarily large. 

To proceed we need to translate such a fundamental 
assumption into concrete expressions for priors. Cacciari 
and Houdeau do this through three supplementary as¬ 
sumptions (which they call “hypotheses”) [ 6 ], as follows. 

• The prior probability densities for coefficients at 
different orders are independent in the sense of (| 8 | . 
I.e., given an upper bound c, the joint prior density 
for coefficients factorizes: 

n 

pr(c 0 ,...,c„|c) = JJpr(ci|c) . (9) 

i=0 

CH then also assume that pr(Q|c) is the same pdf 
for each i. Thus the value of c is the most knowl¬ 
edge obtainable from the known coefficients when 
predicting possible values of unknown ones. In this 
way, we have isolated communication from the data 
about the sum of all omitted higher-order terms 
into one variable c. 

• Next we need a specific prior probability distribu¬ 
tion for pr(c^|c). In the interest of understand¬ 
ing the prior-dependence of our analysis, we test 
alternative implementations of naturalness in the 
priors. The extent to which the posterior pdf 
for A/e is stable under different, but reasonable, 
choices of prior indicates the extent to which data 
on {c n : n = 0 ,..., k} is sufficiently informative to 
dominate the analysis. 

When we know there is an upper bound to the co¬ 
efficients, an application of maximum entropy dic¬ 
tates that the least-informative distribution is uni¬ 
form for \ci\ < c. Such uniformity is additionally 
appealing because it can lead to simple, analytic 
results. This uniform prior is the initial choice of 
Ref. [6 . We employ it in priors we denote as “Set 
A” and “Set B” (see Table [I]), the difference being 
the prior pdf assumed for c in the two cases (see 
below). 

The priors of “Set C” in Table [i] then correspond 
to the ensemble naturalness assumption of Ref. [5]. 


This Gaussian prior follows from the maximum- 
entropy principle assuming knowledge of testable 
information on the mean and standard deviation of 
the c n ’s [ 20 j| 22 ] 

(Ed)=(fc + l)c 2 , ( c n) = 0 • (10) 

\n=0 / 

We will see below that analyses with Sets A and 
B are insensitive to details of the distribution of 
{co,... ,c/c}: the only feature of the distribution 
that matters is the value of the largest of these 
k + 1 lower-order coefficients. On the other hand, 
results under Set C priors are affected by the spe¬ 
cific distribution of these coefficients, as well as by 
the largest value. 

• Finally, the application of Bayes’ theorem requires 
a prior for c: pr(c). Uniformity of Inc is the only 
way to ensure unbiased expectations regarding the 
scale of c [23]. This log-uniform prior for c was cho¬ 
sen in Ref. [ 6 ], and so Set A of Tablets their choice 
of prior. We also employ the log-uniform prior for 
c in Set C, there following Schindler and Phillips in 
Ref. [5.1. Such a prior cannot be normalized for c 
in (—oo, oo) and is therefore termed an “improper 
prior”. Limiting the range of c through the use of 0 
functions permits an examination of the otherwise 
ill-defined limiting behavior. CH chose c< = e and 
c> = 1 /e and take the limit e —» 0 at the end of the 
calculations. Thus, the 6 functions and associated 
lnc>/c< factor serve to regulate the distribution so 
that the pdf is always normalized. Taking the limit 
e —)> 0 expresses complete ignorance of the scale of 
c, although we will also consider finite ranges for 
the marginalization over c, thereby rendering the 
priors more informative. 

In Refs. 0 0, a more informative c prior is con¬ 
sidered based on the fact that the first coefficient 
co can be scaled out. The authors argue that in 
this case it is no longer necessary to allow for an 
arbitrarily large value for the other coefficients. In 
consequence they assume c’s prior is a log-normal 
distribution about zero. We take this as Set B of 
Table [T] Note that scaling the observable so that 
the first coefficient is order unity is also what we 
do for the EFT expansion, see Eq. ([!]). 

In the case of prior information on the naturalness 
of coefficients that is different than that discussed here, 
maximum entropy can be used to derive how the dif¬ 
ferent information should be reflected in alternative pri¬ 
ors [ 22 ]. Such direct conversion from information on the 
interpretation of naturalness to prior pdfs facilitates rig¬ 
orous derivation of the consequences of the concepts in 
question through the use of formal reasoning and the lan¬ 
guage of probability. We now show how this transpires 
by deriving the pdf for A&, initially refraining from spec¬ 
ifying anything about the priors on {cq, ..., c/-}. 
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D. Posteriors and DOB intervals for A & 

Given the three assumptions described above and the 
prior sets of Table [l| we can systematically derive the 
posterior for by repeated application of the sum and 
product rules and their logical Bayesian consequences. 
At each step, we introduce a specific concept being built 


into the analysis. For this general derivation we assume 
that the coefficients start from k = 0 and are all sig¬ 
nificant and non-zero (later we will modify the general 
results to treat the case of NN scattering in chiral EFT, 
where the orders are nominally Q°, Q 2 , Q 3 , .. .with Q 1 
absent). 


1. Formula for pr(Ak|c 0 ,..., Ck): We seek pr(A/c|co,..., c/~), which is the probability density for the dimensionless 
residual, A&, given the known values of the first k -hi coefficients. Because the true value of A& depends only 
(explicitly) on the unknown coefficients c n> /e, we insert them into the equation by integrating over all their 
possible values using <© and 


pr(A fc |c 0 , • • •, Ck) = j pr(Afe|cfe+i, Cfe+2,...) pr(cfc + i, c k +2, ... |c 0 ,..., c k ) dck+\ dc k+2 


/ 


<5 A fc - ]T c n Q r 


n=k+1 


P r (Oc+l, Oc+2, • • • |C 0 , • • • , Ck) dCk+l dCk+2 


where we have used 


pr(A fe |c 0 ,... ,c k ,c k +i,c k +2, ■ ■ •) = p r (A fc |cA; + i,c fe+ 2,...) = S I A fc - ^ c n Q n J . 

V n=k -\-1 / 


(ii) 


( 12 ) 


The latter is a direct consequence of Eq. ©• Equation © states that, to get a specified A^ given a set of 
known coefficients, we need to sum up all the different combinations of c n ’s with n > k that give us this A&, 
weighting each combination by its probability given the known values of coefficients c n for n < k. Note that all 
of these integrals over c n are from — oc to +oo in general, but in particular cases there may be constraints (e.g., 
a cross section is positive, so the leading coefficient will be positive). 

The probability density © is correctly normalized since the normalization integral over A& can be performed 
using the delta function, leaving the normalization integral for pr(c£+i, Cfc+ 2 ? ... |cq, ..., c&), which is unity. 


2. Independent priors: Our priors are based on the assumption that c is the only information that gets trans¬ 
mitted to the distribution of c n for n > k. Thus, at this stage we introduce c as an intermediary in the pdf in 
the integrand of Eq. © via another marginalization integral, and apply this assumption: 


pr(cfc+i, Cfc+2,... |c 0 ,..., Cfc) 


poo 

/ pr(c fc+ i, c k+ 2 ,... \c, c 0 ,..., c k ) pr(c|c 0 , ...,c k )dc 
Jo 

poo 

/ pr(cfc+i, c k+2 ,... |c) pr(c|co, ...,c k )dc 
Jo 

poo 00 

/ [ f[ pr(c„|c) pr(c|c 0 ,..., c k ) dc . 

JO ^ — I 1 


n=k+l 


(13) 


In the final equality we have used the assumption that the C{ distributions are independent, causing the joint 
densities for the c n ’s with n > k to become the product of independent densities pr(c n |c). We will see that the 
imposition of Set A or Set C priors for c makes the limits on this integral finite. 


3. Leading-term approximation: We next assume A & ~ c/ c+ iQ /c+1 = A^ in Eq. (11) and return later to relax 
this assumption. By examining the effect of this assumption on DOB intervals, we will show that this approxi¬ 
mation is quite appropriate for small values of Q. This is the simplest way to exploit the delta function, which 
then depends only on Cfc+i. After substituting Eq. (13) into ©. the Cfc + 2 , • • • integrals are just normalization 
integrals (equal to one), leaving integrations over c^+i and c: 


/ OO poo 

/ [ <5 ( A fc 1) - 

-oo J 0 

= pr( °‘ +1 


Cfc+iQ fc+1 )]pr(cfc+i|c) pr(c|c 0 , ..., c k ) dcdc k+l 
= /Q k+1 \c)pv(c\co,...,c k )dc . 


(14) 
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4. Expanding the composite prior: The first pdf in the integrand of (14) may be directly evaluated for a 
given choice of prior, but the second cannot. It can, however, be identified as being constructed from the priors 
defined in Table [TJ via application of Bayes’ theorem: 


pr(c|c 0 ,...,c fc ) 


pr(co,...,c fe |c)pr(c) 
pr(c 0 ,... ,c fe ) 
pr(c 0 ,...,c fc |c)pr(c) 

Jo 00 pr(c 0 , • • •, c k \c') pr(c') dc' 


J|pr(c„|c) pr(c) 

n =0 


poo k 

/ pr(c')dc' 

n=0 


(15) 


In the second line we have introduced another marginalization over d in the denominator, while in the third 
line we apply the independence assumption of Eq. Combining ( [15] ) and © gives: 


pr(A«|c 0 ,...,c fc ) = 


poo k 

/ pr(c fc+ i = A { ^/Q k+1 \c) [JJ pr(c n |c) pr (c)dc 

^_ n =0_ 

poo k 

Q k+1 [FT pr(c„|c') pr(c')<fc' 

n =0 


(16) 


Now we can apply one of the sets of assumptions in Table [TJ which give us specific forms to evaluate each of 
the pdfs in Eq. (16). Note that if some of the q’s for i < k are identically zero, there are correspondingly fewer 
terms in the products of pr(c n |c) in Eq. (16). 

5. Prior Set A^: Prior Set A has been developed under the assumption that identifying a maximum value c is 
a valid concept. We here start with a finite c range between c< and c> for which Eq. (16) can be evaluated 
analytically. If c< = e and c> = 1/e and we take the limit as e 0, we designate this as Set A e . Meanwhile, 
the superscript W is introduced to denote the use of the first-term approximation. 


For this prior choice, A^ 1 ), the denominator of Eq. (16) is directly evaluated as there are only integrals over 
theta functions: 


poo » poo r n ' 1 

/ [n p r ( c «i g/ ) p r ( g/ ) d5 ' = / [n - i cn D 

1 i r s > 

2/c+1 lnc>/c< 7 max( 


1 


1 


l 


max(c (fc ),c<) 


qJ /c +2 


dc 


(17) 


where we have followed CH [6j and introduced the variable to denote the maximum of the first j T 1 
coefficients: 


c (j) = max(|c 0 |, 


,-!)• 


(18) 


The integration over c in the numerator of Eq. (16) is similar, but contains the extra pdf pr(c/e + i = A^/Q /c+1 |c) 
in the integrand. Using the theta functions to once again define the integration bounds, the numerator simplifies 
to 


1 1 

2^+2 lnc>/c< 


0 


( c > —c (fc+i)) [ 

J rr 


c> 

max(c (k + i),c<) 


1 

ojk~ I - 3 


dc . 


(19) 


We can see now how regulating the integrals with c> and c< will allow terms such as lnc>/c< (and most factors 
of 2) to cancel between the numerator and denominator, after which we may choose to take e to zero in A^ 
without consequence. 
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More generally, we assume that the integration range for c is wide enough that c< < C( k ) < c>. The posterior 
then evaluates to: 


P r (^fe 1) | c 0i ■■ ■ i c k) 


1 l/fc + l\ 
Q k+l 2 \k + 2j 


6 (c> — c (fc+ i)) 


( fe+ i) _ _(fe+1) 

c (fe) c > 


r=-(*+2) _ =-(*+2) 

c (fc) c > 



if |a«| < c (k) Q k+1 
if |a£ 1) | > c (k) Q k+1 


( 20 ) 


If some of the coefficients are zero (e.g., the series starts at Q l with / > 0, or one or more intermediate coefficients 
are zero) we can revise these formulas trivially: the only change is that there are fewer theta functions in the 
integrals. Taking n c to be the number of non-zero constants, we implement this generalization by replacing k 
by n c — 1 everywhere except for powers of Q, which remain k + 1. Thus, the modified posterior for Set A^ 1 ) is 


--(n c + l) (n c + l) 


pr(A^ 1) |co,...,c /c ) = 


1 1 


Q k+1 2\n c + l 


9 (c> c^ k _ i-i)) 

r~ n c _ ^-n c 

C (k) C > 



if |a£ 1) | < c {k) Q 


fc+i 


n c +1 


-(n c +1) 


if I a£ 1} I > c {k) Q k+1 


, ( 21 ) 


which simplifies to the corresponding equation of Ref. [6] in the limiting case of A, 


(i) 


p , (A< 1| |c,-..,e t ) = (^) 


if |a£ } | <c (k) Q k+1 


n c +1 


if |A 


k ) \>C(k)Q k+l 


( 22 ) 


Note that this simple generalization is possible due to the identical treatment of the priors for each coefficient. 

6. Prior Sets and C^ 1 ^: Neither Set B nor Set C priors allow for analytic integrals over c, so the discussion 
here will necessarily be less extensive than that for Set A. In the first-term approximation the posterior for A k 
can be reduced to a one-dimensional integral, whose evaluation must be left to numerical integration. Inserting 
Set C priors into Eq. (16) results in 


P r (Afc 1) | c 0, ■ ■ ■ , c k) = 


1 1 c> 


/ \ n c +2 

0) 

Ue-^ 2 

_n 

Q k+l P 

2(Q k + 1 ) 2 c 2 

i 

c< 

( -J- \ n c + l 

V5V 

Yl e -cl/2c' 2 

_ n 



(23) 


where products are assumed to run over all n c coefficients with defined prior distributions. 

7. DOB intervals: In the Bayesian framework, the posterior pr(A/e|co,..., c k ) contains the complete information 
we claim to have about the dimensionless residual A k . In some applications we need to use the entire posterior 
because it is very structured (e.g., multi-modal or simply non-gaussian), but here we can capture most of the 
information with the choice of a small number of degree-of-belief (DOB) intervalsj^] 

In particular, the DOB for a particular interval in A k is found simply by integrating pr(A/ c |co, ...,c k ) over this 
interval. We could also start with a given DOB, e.g., the standard frequentist (gaussian) 68% or 95%, and 
determine the smallest interval that integrates to that number. Or we could specify some other criterion for 
deciding the interval, such as that it is symmetric about the mode. In fact, the use of any of the priors in Table [T] 
results in a smallest p%-DOB interval for A k that is symmetric about the mode; following Ref. [6] we denote 
the corresponding dimensionless limits by d= . Thus the implicit definition of this interval is 

r d i P) 

p%= pr(A/e|co,..., c k ) dA k . (24) 

J -4 P) 


2 These are also called “credibility” or “credible” intervals. 

























In the limiting case of prior Set , this integral can be evaluated explicitly [5j: 


Uc + 1 


j(p) 77 f)k +1 v 

dk ^(fc) N 




,(n c + 1)(1 -p%) 


l/n c 


if p < 
if p > 


n c ~f 1 


( 25 ) 


where n c is again the number of non-zero known coefficients. Thus, with these priors, the interval of width 
d(/ c )Q /c+1 about the EFT prediction at order k is a n c /(n c + 1) * 100% DOB interval, cf. Ref. [ 6 ]. Such a 
theory error bar has often been assigned in previous EFT calculations, and—as we shall discuss further in 
Section [TTT] —corresponds to the prescription formalized in Refs. mm- It is important—e.g., in the context of 
error propagation—to keep in mind that this prior leads to a distribution of probability for the truncation error 
that is not Gaussian. 

For the more general form of prior Set A^ 1 ), an analytic formula for can still be found. The explicit form 

of the integral depends on the p% value of interest, because of the change in structure for |A^| > c^Q k+1 . 
Thus, we first calculate this transition value (p%) t by integrating the maximal probability within the first region 
in which |cfc+i| < to obtain 


(P%)t = 


-n c + l —n c -\-l 

C (/e) C > 

1 1 ~ 

C “(fc) ^> C 


Ur 


c (k) 


(26) 


Now suppose we are interested in p% intervals for p < p t . Equation (26) implies that the interval bounded by 
variation ±C(/ c )Q /c+1 is a (p%) t -DOB interval. Generally, the DOB interval for Set A^ 1 ) is bounded by 


d[ r, = 


C nc + 1 

C (fc) 


_L_ 1 _ 

—n r —n r 

c ( fe ) c > 


-l 


-P%Q 


k+i 


if p% < (p%) t 


(27) 


When one is interested in larger p% values, it may be beneficial to take advantage of the normalization of the 
pdf to conduct an integration in only one region by integrating the second case of Eq. (21) on the interval 


(p) 


k >' 


. Because c^+i) = c^+i in this region, the theta function truncates this integration at = c>Q /c+1 . 
The resulting implicit expression for if p% > {p%) t is thus 


(1 -P%) = 


Q k+1 


n c 


1 


n c + 1 J — -A- 

\k) C >° 


(4 P) -c>Q fe+1 ) (Qfc+l ) n = +1 

_L 1 T 


^c + 1 


(4 P) )" C {c > Q k+1 ) r 


(28) 


For Set B^ 1 ), Set CV\ or, indeed, for any of the sets if we do not make the first-term approximation, the DOB 
interval d^ can be found numerically from Eq. (24) by integrating pr(A/ c |co,..., C&) (e.g., from Eq. (23)) from 


zero until the integral equals p/2. We stress again the resulting DOB intervals are not standard deviations and 
make no statement about the shape of the normalized function which integrates to 0.68 between the bounds 
d zd 


( 68 ) 


8 . Relaxation of first-term approximation: 

we introduce the generalized notation 


To relax the assumption that the first omitted term dominates, 


^rriax 

Afe » A+ Y, c mQ m = A { k k ™ x ~ k) = A { k h) , (29) 

m=k-\- 2 

where k max is the highest-order coefficient kept in the sum of omitted terms. Returning to step 3 above, we 
continue to use the S function to eliminate the integral over Cfc+i, with the result that A^ is replaced by 
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A^ax k ) - m ^he subsequent expression and the integrations over c m for m > k + 1 up to m = & max remain. 
The generalization of Eq. (16) is then 


P r (A^ ) | co,...,c k ) = 


/ oo roo /»oo r ^max 

•••/ / P r ( c fc+i = Aj^ l) (Q)/Q fc+ 1 |c) JJ pr(c ra |c) pr(c)<ic JJ pr(c m |c)<fc 

-oo J-ooJo _ n=0_ m=fc+2 _ 

/•OO _ & 

Qfc+i / npr(c „| c - 0 pr (c / ) dc' 

n=0 


, ( 30 ) 


where there are fc max — k 1 integrals from -oo to oo in the numerator, in addition to the integral over the 
(positive) c. Note that if the first omitted term really does dominate, then the integrals over higher c m ’s are 
trivial normalization integrals, restoring the result of the first-omitted-term approximation. 


9. Summary: We have derived a general result for pr(A/ c |co,... ,c&) in Eq. (30), which is valid for any of the 
sets in Table [l| In most cases this expression must be evaluated numerically, for example by Monte Carlo 
integration. By assuming the first omitted term dominates, we obtain the much less involved integration in 
Eq. (16). Evaluating the application of this approximation to Set A^ 1 ) yields the analytic result in Eq. (21) 
while for Sets B^ 1 ) and CK 1 ) integrals are left to be evaluated numerically—see 
intervals can be derived from these posteriors analytically for A^ 1 ) (Eqs. (25), (27) 
for the others from Eq. (24). 


e.g., Eq . (|23| ). Finally, DOB 
and (28)) and numerically 


E. Representative examples 


Before applying the Bayesian framework developed by 
CH, and extended above, to the specific problem of NN 
scattering, we make some general observations on the 
form of the posteriors for A k and the systematics of the 
68 % and 95% DOB intervals for various prior sets from 
TableH 

We start with the set A e , defined by c< = e, c> = 1/e, 
with e —>• 0 (in practice all results here in which e is in¬ 
voked use e = 0.001). The posterior distribution for A^ 
in Eq. ( [2l| ), which assumes the first omitted term dom¬ 
inates the error, has a flat central plateau with power- 
suppressed tails. This is illustrated by the red curves in 
Fig. [l] for k = 0, k = 2, and k = 4, for Q = 0.33. The 
heavy and light shaded regions show the 68 % and 95% 
DOB intervals, respectively. From Eq. (25), the width 
of the posterior is given by c^ k )Q k+1 times a number of 
order unity, so the dominant effect is that the width de¬ 
creases by a factor of Q with each increase of k by one. 
In fact, only the maximum value of the c n ’s for a given fc, 
C(fc), matters under this choice of prior; the distribution 
of those c n ’s is irrelevant. The overall size of all DOB 
intervals then scales linearly with so here Co, ..., c k 
have all been set to one for simplicity. For Set A priors 
the generalization to other cases is trivial. 

To relax the first-term approximation we include the 
first four omitted terms in our computation of A k . The 
result is then converged numerically in all cases shown, 
so in practice Set (Y = A, B, or C) priors lead to 
the same results as when arbitrarily many higher-order 
terms included in the truncation-error calculation. In 
consequence, we do not include superscripts below when 
reporting results with terms beyond the first omitted one 
included in the computation of A k . Such calculations 


show that the central plateau in the posterior becomes 
rounded (blue curve in Fig. [l]). The corresponding effect 
on the DOB intervals depends on the value of for k > 2 
there is no significant effect on the 68 % DOB intervals 
while the 95% intervals are increased slightly. 

If we use the more informative log-normal prior for c 
from set B, the tails are more quickly suppressed than for 
set A e , to a degree that depends in detail on the values of 
a and Q mi. Representative examples for A 2 are shown 
in Figs. [ 2 ] and [3] for Q = 0.2 and Q = 0.5, respectively, 
with a = 0.25, 0.5, and 1.0 respectively. We see that the 
68 % DOB and 95% DOB intervals are smaller that those 
for A e , with the difference increasing with smaller a. The 
further extension of the tail for A e is not surprising as we 
have allowed for the possibility of c having a large range. 
As a gets larger, the posteriors for each Q value approach 
the A e result; once a > 1.0 there is very little difference 
between A e and Set B for k > 2 . k = 0 and k = 1 are 
more sensitive to a. 

We might expect that the Set B results with a > 1.0 
will be even closer to those from Set A if we impose a 
range of c values in Set A that reflects naturalness ex¬ 
pectations. Results of varying the range over which c is 
marginalized are shown in Table [TTJ where we compare 
DOB intervals for A k with k from 0 to 4. For k > 3, the 
change in the range of c has no noticeable effect on either 
the 68 % or 95% DOB. For k = 2 , effects are 5-10% on 
the 68 % interval if a narrow range (c from 0.5 to 2.0) is 
employed. Effects on the 95% interval can be up to 20% 
if this narrow range is employed at k = 2 . 

The Set C priors are qualitatively different from Set A 
because they correspond to an ensemble naturalness as¬ 
sumption for pr(Q|c), which means that the distribution 
of c n ’s for a given k —and not just their maximum— 
affects the result. This is illustrated by the results in Ta- 
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pr( A o I c 0 ) 


pr(A 2 I c Q , c r c 2 ) 


pr(A 4 I c Q , c r c 2 , c 3 , c 4 ) 



FIG. 1. (color online) Posteriors for under Set A n1 priors of Table [l] lor k = 0,2, and 4 with Q = 0.33. In all cases, e„ = 1 
was assumed. The solid red curve is the analytic result from Eq. (21), with the shaded regions marking the 68% and 95% DOB 
intervals. The solid blue curve is the posterior A& for A e once higher-order terms are included, with dot-dashed and dashed 
lines marking the corresponding 68% and 95% DOB intervals. 


pr(A 2 I c Q , Cj, c 2 ) pr(A 2 I c () , Cj, c 2 ) pr(A 2 I c Q , c r c 2 ) 



FIG. 2. (color online) Comparison of A 2 for prior set A e (red solid line) and B (blue solid line) for cr & 0.25, 0.5 and 1.00 
respectively. In each case co,ci,C 2 are all set to unity with an expansion parameter Q = 0.2. The DOB intervals are indicated 
as in Fig. [l] 

pr(A 2 I c Q , Cj, c 2 ) pr(A 2 I c () , Cj, c 2 ) pr(A 2 I c Q , c r c 2 ) 





FIG. 3. (color online) Comparison of A 2 for prior set A e (red solid line) and B (blue solid line) for three values of a and 
co,ci,C 2 all set to unity with an expansion parameter Q = 0.5. The DOB intervals are indicated as in Fig. [l] 


ble III, in which DOB intervals for A 2 with prior choices 
A e and C e are compared. Because k = 2 , the coefficients 
Co, ci, and C 2 are all influential; we consider three repre¬ 
sentative choices for their values. The systematics going 
from c n sets a to b to c show that having more c n ’s near 
one leads to larger DOB intervals. Taking c n (b) to give 


generic results for a roughly even distribution of coeffi¬ 
cients we find that the Set C vs. Set A comparison is close: 
only a 10-15% increase for the 68% DOB and a roughly 
20% increase for the 95% DOB. (The Set A intervals are 
wider for all but the case in which all three known coeffi¬ 
cients are 1.0.) This reflects the stronger central peaking 
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pr(A 2 I c 0 =1.0, c^l.O, c 2 =1.0) 



pr(A 2 I c Q =1.0, <^=0.5, c 2 =0.1) pr(A 2 I c Q =1.0, c^O.l, c 2 =0.1) 



FIG. 4. (color online) Comparison of A 2 for prior set A e ^ (red solid line) and C e ^ (blue solid line)—note both are in 
the leading-omitted-term approximation—for three sets of c n values with an expansion parameter Q = 0.33. These sets are 
(left-to-right) {c n }(a) = {1.0,1.0,1.0}, {c n }(5) = {1.0, 0.5, 0.1}, and {c n }(c) = {1.0, 0.1,0.1}. The DOB intervals are indicated 
as in Fig. [l] 


TABLE II. Resulting 68% and 95% DOB intervals for A& using Set A^ with different choices for the minimum and maximum 
of the c prior. In all cases C( fe ) = 1. 



min / max 

Q 

k = 0 

k = 1 

k = 2 

k = 3 

k = 4 


0.001/1000 


0.31 

0.041 

0.0073 

0.00136 

0.00026 


0.25/4.0 

0.20 

0.22 

0.039 

0.0072 

0.00136 

0.00026 


0.50/2.0 


0.18 

0.035 

0.0068 

0.00132 

0.00026 


0.001/1000 


0.51 

0.111 

0.033 

0.0101 

0.0032 

68% 

0.25/4.0 

0.33 

0.36 

0.106 

0.032 

0.0101 

0.0032 


0.50/2.0 


0.30 

0.095 

0.030 

0.0098 

0.0031 


0.001/1000 


0.78 

0.26 

0.113 

0.053 

0.026 


0.25/4.0 

0.50 

0.55 

0.243 

0.112 

0.053 

0.025 


0.50/2.0 


0.45 

0.22 

0.106 

0.051 

0.025 


0.001/1000 


1.96 

0.103 

0.0137 

0.0023 

0.00041 


0.25/4.0 

0.20 

0.47 

0.077 

0.0129 

0.0022 

0.00041 


0.50/2.0 


0.29 

0.056 

0.011 

0.0020 

0.00039 


0.001/1000 


3.2 

0.28 

0.0614 

0.0168 

0.0050 

95% 

0.25/4.0 

0.33 

0.77 

0.21 

0.058 

0.0166 

0.0050 


0.50/2.0 


0.48 

0.152 

0.048 

0.0150 

0.0047 


0.001/1000 


4.91 

0.645 

0.21 

0.0884 

0.040 


0.25/4.0 

0.50 

1.16 

0.48 

0.201 

0.087 

0.040 


0.50/2.0 


0.73 

0.35 

0.166 

0.079 

0.038 


of the Set C pdf under a reasonable distribution of the 
first three coefficients, as depicted (in the first-omitted- 
term approximation) in Fig. [4j Such differences in DOB 
intervals under different prior choices will be amplified if 
k = 0 or k = 1. 


Table |III| also assesses the approximation of keeping 
only the leading omitted term in A 2 . Once Q = 0.5 
we see appreciable differences between Set C^ and Set 
C results that include multiple higher-order terms, but 
even then it is only about a 15% effect on the error bar. 


III. COMPARISON TO RECENT CHIRAL EFT 
RESULTS FOR NP SCATTERING 

A. EKM’s truncation-error estimates 

Chiral perturbation theory (yPT) encodes the conse¬ 
quences of QCD at momenta of order the pion mass Ha¬ 
ng. it can be used to compute the interaction of single 
nucleons and pions for momenta well below the chiral- 
symmetry-breaking scale, A x sb- yPT yields a purely 
perturbative expansion in powers of (p, m n )/A x sB for 
low-energy pion-pion and pion-nucleon scattering. But, 
nuclei are bound states, and will not be generated from 
such an expansion. 

In the early 1990s Weinberg pointed out that the in¬ 
frared enhancement associated with multi-nucleon inter¬ 
mediate states meant that the yPT expansion cannot be 







































































































12 


TABLE III. DOB intervals for Sets A e and C e , with and without the leading omitted term approximation, for k — 2, with 
three sets of c n values. These sets are (c n }(a) = {1.0,1.0,1.0}, {c n }(5) = {1.0, 0.5,0.1}, and {c n }(c) = {1.0,0.1,0.1}. 



Q 

AP/A e 

cP/C e {c n }(a) 

cP/C e (c„}(6) 

C) 1} /C £ {c n }(c) 


0.20 

0.0073/0.0073 

0.0095/0.0097 

0.0062/0.0063 

0.0056/0.0057 

68% 

0.33 

0.033/0.033 

0.043/0.045 

0.028/0.029 

0.025/0.026 


0.50 

0.113/0.123 

0.149/0.171 

0.096/0.111 

0.087/0.100 


0.20 

0.0137/0.0137 

0.025/0.026 

0.017/0.017 

0.015/0.015 

95% 

0.33 

0.061/0.066 

0.114/0.121 

0.074/0.079 

0.067/0.071 


0.50 

0.21/0.25 

0.40/0.46 

0.26/0.30 

0.23/0.27 


applied directly to the scattering amplitude in systems 
with more than one nucleon [24]. He argued that the xPT 
Lagrangian and counting rules should instead be used to 
compute an NN (or NNN or ...) potential up to some 
fixed order, n, in xPT. Such an expansion can then be 
examined for convergence with n. The x?T potential V 
was computed to 0(Q 3 ) in Refs. [25H27] . and to 0(Q 4 ) in 
Refs. [28, 29,. Consistent three-nucleon forces have been 
derived and implemented in such an approach [30] E]. 

However, while there is a xPT expansion for V , the re¬ 
sulting nuclear binding energies (and other observables) 
contain effects to all orders in the chiral expansion: there 
is no obvious perturbative expansion for them. In prac¬ 
tice, chiral EFT for few-nucleon systems is often imple¬ 
mented as described in the previous paragraph, but with 
the Hamiltonian acting only on a limited space: in mo¬ 
mentum space a cutoff A in the range 450 < A < 800 
MeV must be imposed [32] • From now on when we use 
the term chiral EFT in the context of few-nucleon sys¬ 
tems we mean calculations that are carried out in this 
way. A formal justification of the Q-expansion (e.g., 
via the distorted-wave Born approximation evaluation of 
higher-order contributions [331435] or use of a relativis¬ 
tic propagator [Ml E7]) requires a more sophisticated 
power counting [3814401 . Nevertheless, in practice, the 
convergence of chiral EFT calculations for observables 
can be examined a posteriori to see if they inherit the 
Q-expansion that has been used for the potential. 

In two recent papers, EKM estimated the errors that 
arise from truncation of the chiral EFT expansion at a 
finite order [T 0 l HI] (see also Ref. M)- Similar prescrip¬ 
tions have previously been used in other EFT contexts 
(see, e.g., Refs. [42l [43]). Such estimates apply to in¬ 
dividual observables (such as the total cross section for 
neutron-proton scattering at a given lab energy or nu¬ 
cleon electric and magnetic polarizabilities). They are 
independent of procedures used to fit LECs to two-body 
scattering data at each order. While Bayesian analysis 
could also be applied to those procedures that is not our 
concern here; it will be the focus of a future publica¬ 
tion [18]. 

Instead, EKM assume that the EFT expansion holds 
for individual observables X(p), i.e., 

k 

X(p)=X 0 ^c n (p)Q n , (31) 

n=0 


with Q the EFT expansion parameter, and c\ = 0 in 
the Weinberg expansion for NN scattering in chiral EFT. 
Cumulative sums at LO, NLO, N 2 LO, N 3 LO, and N 4 LO 
are then given by: 

X LO (p) = c 0 (p), (32) 

V nlo (p) = Yi c n{p)Q n , (33) 

n=0 

3 +1 

X NJLO (p) = Y, Cn(p)Q n , j = 2 ,3,4 . (34) 

n=0 

EKM also assume that the dominant error at order 
k comes from the first omitted—(fc + l) th —term. Two 
ingredients go into their estimate of this term. The first 
is to identify the EFT expansion parameter Q, defined 
as 


0 s m “ (a? Tf) ' (35) 

Note that, in contrast to pQCD, this is a momentum- 
dependent expansion parameter, and so the expansion 
will perform differently at different kinematic points. 
Furthermore, to know Q we must identify A&, the break¬ 
down scale of the EFT. In Refs. mm, EKM estimate 
A 5 from error plots of the fit phase shifts. The second 
ingredient is to determine the shift beyond N J LO as: 

AV NiLO = ^ +2 max(|c 0 |, | Cl |.|c j+1 |) , (36) 


where the c n ’s are defined as above. 

In Refs. [ 10 , [TT] the expressions for the theory error 
are defined via differences of the partial sums (34), but 
the result may be summarized compactly according to 
Eq. (36). The similarity of this prescription to the sim¬ 
plest analytic form obtained above with A^ 1 ) priors, the 
CH procedure written in Eq. (25), is evident. For a given 
observable, the value of Q that is identified defines the 
perturbative expansion parameter, and the EKM uncer¬ 
tainty is the maximum coefficient times the first omitted 
power of Q. Up to factors of order unity, this is what 
Eq. (25) predicts for the 68 % (“1 a”) DOB interval. There 
is then clearly a semi-quantitative correspondence. We 
now make a quantitative comparison using the various 
priors from Table [I] 

We note that in chiral EFT, having posited a Q ex¬ 
pansion, we do not find the coefficients directly but ex¬ 
tract them from the calculations at different orders. In 
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TABLE IV. Order-by-order calculations for cr np in mb for 
R — 0.9 fm from EKM TO, [lT| '44 j. Lab energy Ti a b and 
relative momentum p re i are in units of MeV. 


TABLE VII. Dimensionless coefficients from the expansion of 
(j n p — a LO En= 0 CnQ n for R — 1.2 fm from Table [v[ with 
Q = Prei/400 MeV. 


Tl a b 

Prel 

OLO 

ONLO 

cr N 2 LO 

C’nSLO 

cr N 4 LO 

Tl a b 

co 

Cl 

c 2 

C3 

C4 

C5 

50 

153 

183.6 

166.5 

167.0 

166.8 

167.5 

50 

1.0 

0.0 

0.23 

0.09 

0.47 

0.54 

96 

212 

84.8 

75.1 

78.3 

77.5 

78.0 

96 

1.0 

0.0 

0.51 

0.27 

1.43 

0.16 

143 

259 

52.5 

49.1 

54.2 

53.7 

53.9 

143 

1.0 

0.0 

0.60 

0.33 

2.07 

0.03 

200 

307 

34.9 

35.9 

42.6 

43.2 

42.7 

200 

1.0 

0.0 

0.52 

0.32 

2.28 

-0.07 


TABLE V. Order-by-order calculations for cr np in mb for R = 
1.2 fm from EKM [101 [TTl 144] . Lab energy Ti a b and relative 
momentum p re i are in units of MeV. 


Tl a b 

Prel 

CLO 

CNLO 

cr N 2 LO 

CnSLO 

cr N 4 LO 

50 

153 

159.4 

164.8 

165.6 

167.2 

167.9 

96 

212 

60.2 

68.9 

71.3 

78.1 

78.5 

143 

259 

30.8 

38.6 

41.4 

52.6 

52.7 

200 

307 

17.2 

22.5 

25.0 

38.6 

38.3 
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C, 


Prel 

- 307 MeV 

- 259 MeV 

- 212 MeV 

- 153 MeV 


0.5 1 1.5 2 2.5 


contrast, for the QCD expansions, the coefficients are 
calculated independently of each other. Thus the EFT 
application will require additional empirical verification, 
see Sec. IIV1 


FIG. 5. (color online) Chiral EFT expansion coefficients from 
Table |VI| for a np at four different energies using potentials 
with regulator parameter R = 0.9 fm and A^ = 600 MeV. 
Note that the coefficient C 5 = 3.5 at Ti a b = 50 MeV is off 
scale. 


B. The pattern of EFT coefficients in EKM’s result 

In Refs. [IQJ and [44], results for the neutron-proton 
total cross section at various energies are given for sev¬ 
eral values of a coordinate-space regulator parameter R. 
These provide empirical tests of the priors from Table [I] 
The order-by-order cross sections are given in Tables [TV] 
and [Vi for R = 0.9 fm and R = 1.2 fm, respectively. In 
line with Eq. 0 , we write the cross section at order k in 
the chiral EFT expansion as 

k / \ n 

&np{E lab) ~ ^ref ^ ^ C n f ~jT~ ) 5 (37) 

n=0 ' k/ 

where cr re f is a reference cross section that might be taken 
as ctlo? as we do here, or the N 4 LO result, or the ex¬ 
perimental value. The analysis is not sensitive to this 
choice. The breakdown scale A5 was identified in Ref. uni 
as A 5 = 600 MeV for cutoffs R = 0.8, 0.9, and 1.0 fm, 
A 5 = 500 MeV for R = 1.1 fm, and A 5 = 400 MeV for 


TABLE VI. Dimensionless coefficients from the exp ans ion of 


(Jnp — ^loELo Cn( ? n f° r R — 0.9 fm from Table 
Q = Prel /600 MeV. 


IV 


with 


Tlab 

Co 

Cl 

c 2 

c 3 

C 4 

C 5 

50 

1.0 

0.0 

-1.43 

0.16 

-0.26 

3.5 

96 

1.0 

0.0 

-0.92 

0.86 

-0.61 

1.07 

143 

1.0 

0.0 

-0.35 

1.21 

-0.27 

0.25 

200 

1.0 

0.0 

0.11 

1.44 

0.25 

-0.41 


R = 1.2 fm. Note that this decrease in A& with increasing 
R corresponds to the change in the regulator cutoff scale 
rather than a change in the intrinsic underlying break¬ 
down scale. In Ref. @] it was emphasized that resid¬ 
uals for a /cth-order EFT calculation had two types of 
errors: regulator artifacts dictated by the imposed cutoff 
A 1/R in this context) and truncation errors in the 
Hamiltonian dictated by the underlying breakdown scale 
Afc. EKM do not make this distinction in their notation, 
i.e., they use A 5 for both. 

Under the EKM choices for A&, the dimensionless c n 
coefficients are given in Tables [Vi] and |vn]and Figs. [5] 
and [ 6 ] for the R=0.9 fm and R= 1.2 fm cases respectively. 
Although the coefficients in both cases are natural, rather 
different patterns are seen. As discussed and illustrated 
by EKM (e.g., see Fig. 2 of Ref. [H]), the softer cutoff 



FIG. 6 . (color online) Chiral EFT expansion coefficients from 
Table |VI| for cr np at four different energies using potentials 
with regulator parameter R = 1.2 fm and Ab = 400 MeV. 
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shifts contributions between different chiral orders so that 
the systematic pattern of corrections is disrupted. In par¬ 
ticular, corrections at orders N 2 LO and N 4 LO, which are 
purely from non-analytic terms in the chiral expansion, 
become heavily regulated by the soft cutoff. This has 
the consequence that the corresponding coefficients are 
anomalously small—which may, in turn result in N 3 LO 
coefficients being somewhat large. This pattern is seen 
in Fig.[6j but what is shown there is insufficient to defini¬ 
tively establish there is an inter-order correlation due to 
regulator artifacts. Here we focus on the R = 0.9 fm 
example to ensure that the pattern is primarily driven 
by the inheritance of naturalness from the fit low-energy 
constants (LECs), and not by regulator artifacts that 
spring from a choice of R that makes the A ~ 1 /R effects 
predominate over the physics at that was integrated 
out of the theory. 


C. DOB intervals from a Bayesian analysis 


There is a minimum of necessary information that 
must exist between the prior and data in order for the 
resulting posterior to accurately describe the above dis¬ 
tributions. Two extremes exist: a large supply of precise 
and accurate data paired with an uninformative prior 
(or, even worse, an informative yet incorrect prior) and 
a small amount of data paired with a precisely and accu¬ 
rately defined prior. Each of these situations may result 
in realistic posteriors as lack of information in one realm 
is compensated by abundance in the other. In practice, 
though, we conduct analyses between these extremes. We 
are often able to define a reasonable, and appropriately 
loose, prior that is subsequently fine-tuned by a mod¬ 
est amount of data. We will now show that each of the 
priors defined in Table [I] may be considered reasonable 
representations of naturalness in the EFT-coefhcient dis¬ 
tribution obtained in the previous subsection. The DOB 
intervals that result from Bayesian analyses using these 
priors show agreement and increased similarity at low 
Q and high k —where the strength of available data is 
greatest. 

In Fig. [7| cross sections from Table |TV| for R = 0.9 fm 
at four different energies are plotted order-by-order in 
the chiral expansion, with error bars indicating the 68% 
and 95% DOB intervals if we adopt prior set A e . The 
LO / error bars are from the calculation for the posterior 
of Ao while the LO error bars are from the posterior of 
Ai. When calculating Ai we have k = 1 and n c = 1, 
so the resulting error bar is simply Q times the LO 7 one. 
This is the correct error estimate for a LO chiral EFT 
calculation of NN scattering, as long as we know a priori 
that the coefficient c\ in the expansion (37) is identically 
zero. 

Cross sections at subsequent orders generally fall 
within the DOB intervals of lower-order error analyses— 
in accord with the DOB intervals’ statistical interpre¬ 
tation. The order-by-order decrease in the error bars 


primarily reflects the additional factors of Q with each 
successive order. The very conservative assumption for 
pr(c) used here, which encodes ignorance of its scale even 
though we anticipate naturalness, leads to long tails in 
the posterior for the lowest orders and correspondingly 
large 95% DOB intervals— further reflecting the non- 
Gaussian nature of this distribution. When we use a 
form for pr(c) that reflects the expectation of natural¬ 
ness, the long tails are suppressed and the 95% DOB 
intervals are closer to “2cr” errors, although the pdfs do 
remain non-Gaussian in general. 


Table VIII| is an analytical compilation of DOB inter¬ 
vals in the limiting case of A e with the leading-term ap¬ 
proximation, see Eq. (25). The top number in each cell 
has been calculated using coefficients of Table m which 
are scaled with the cr re f corresponding to each energy. 
The lower number is the resulting DOB interval in units 
of mb with the factor of a re f = <jlo included. The 95% 
DOB interval being more than 6 times broader than the 
corresponding 68% DOB interval at LO / and LO empha¬ 
sizes the strength of tails within these posteriors. 

Representative numerical results for the various prior 
sets are given in Table |IX| Though we omit mention of 
cr re f, values here should also be multiplied by the energy- 
appropriate cr re f (i.e., olo) to obtain DOB values in units 
of mb. Systematically, we observe that the ratios of DOB 
intervals between prior sets are the same across all 4 en¬ 
ergies for LO 7 and LO as all ci’s are 0 and we have scaled 
all Co’s to the value 1.0. Thus, given the same set of coeffi¬ 
cients, all posteriors scale similarly with energy. Table |IX| 
also shows that the ensemble prior in Set C e generally 
predicts 68% DOB intervals quite similar to those from 
Set A e , with much greater variation for 95% DOB inter- 



FIG. 7. (color online) Cross sections at different energies and 
orders from EKM, with DOB intervals at each order using Set 
A e priors. The thick error bars indicate 68% DOB intervals 
while the thin error bars indicate 95% DOB intervals. 
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TABLE VIII. DOB intervals for EKM cr np with R — 0.9fm using prior Set Ae% 


DOB 

Elab 

Q 

LO' 

LO 

NLO 

n 2 lo 

n 3 lo 

n 4 lo 

68 % 

50 

0.255 

0.40 

0.102 

0.024 

0.0055 

0.0013 

0.00079 

x 183.6 mb 

73. 

19. 

4.4 

1.0 

0.24 

0.15 

68 % 

96 

0.354 

0.55 

0.20 

0.045 

0.0142 

0.0047 

0.0017 

x 84.8 mb 

47. 

17. 

3.8 

1.2 

0.40 

0.15 

68 % 

143 

0.432 

0.68 

0.29 

0.082 

0.038 

0.015 

0.0064 

x 52.5 mb 

35. 

15. 

4.3 

2.0 

0.81 

0.34 

68 % 

200 

0.511 

0.80 

0.41 

0.136 

0.089 

0.043 

0.021 

x 34.9 mb 

28. 

14. 

4.7 

3.1 

1.5 

0.73 

95% 

50 

0.255 

2.6 

0.650 

0.061 

0.0103 

0.0022 

0.0012 

x 183.6 mb 

470 

120 

11 . 

1.9 

0.40 

0.23 

95% 

96 

0.354 

3.5 

1.25 

0.115 

0.027 

0.0079 

0.0027 

x 84.8 mb 

300 

110 

9.8 

2.3 

0.67 

0.23 

95% 

143 

0.432 

4.3 

1.87 

0.21 

0.072 

0.026 

0.010 

x 52.5 mb 

230 

98. 

11 . 

3.8 

1.4 

0.53 

95% 

200 

0.511 

5.1 

2.6 

0.35 

0.17 

0.071 

0.033 

x 34.9 mb 

180 

91. 

12 . 

5.9 

2.5 

1.1 


TABLE IX. DOB intervals for EKM a np scaled by ctlo with R = 0.9fm. Results for prior Sets A e , B (with a = 1.0), and C e , 
all without the leading-omitted-term approximation. 



set 

Elab 

Q 

LO' 

LO 

NLO 

n 2 lo 

n 3 lo 

n 4 lo 


A e 



0.43 

0.11 

0.025 

0.0055 

0.0013 

0.00080 


c e 

50 

0.255 

0.48 

0.12 

0.028 

0.0053 

0.0011 

0.00056 


B 



0.29 

0.073 

0.022 

0.0052 

0.0013 

0.00076 


A e 



0.59 

0.21 

0.048 

0.015 

0.0048 

0.0018 


C e 

96 

0.354 

0.69 

0.25 

0.060 

0.019 

0.0058 

0.0021 

68 % 

B 



0.40 

0.143 

0.043 

0.014 

0.0047 

0.0017 

A e 



0.74 

0.32 

0.089 

0.040 

0.016 

0.0067 


Ce 

143 

0.432 

0.87 

0.38 

0.088 

0.043 

0.015 

0.0059 


B 



0.51 

0.22 

0.080 

0.038 

0.016 

0.0065 


Ae 



0.91 

0.46 

0.15 

0.097 

0.046 

0.022 


Ce 

200 

0.511 

1.08 

0.58 

0.14 

0.096 

0.041 

0.019 


B 



0.63 

0.32 

0.14 

0.091 

0.044 

0.022 


Ae 



2.7 

0.69 

0.066 

0.011 

0.0023 

0.0013 


Ce 

50 

0.255 

3.3 

0.85 

0.089 

0.014 

0.0027 

0.0013 


B 



0.67 

0.172 

0.042 

0.0091 

0.0021 

0.0012 


Ae 



3.8 

1.3 

0.13 

0.030 

0.0088 

0.0030 


Ce 

96 

0.354 

4.8 

1.7 

0.20 

0.050 

0.0142 

0.0049 

95% 

B 



0.97 

0.34 

0.088 

0.026 

0.0083 

0.0029 

Ae 



4.7 

2.0 

0.24 

0.083 

0.030 

0.012 


Ce 

143 

0.432 

6.0 

2.6 

0.29 

0.114 

0.038 

0.014 


B 



1.22 

0.53 

0.17 

0.071 

0.028 

0.0115 


Ae 



5.7 

2.9 

0.41 

0.20 

0.088 

0.041 


Ce 

200 

0.511 

7.4 

3.8 

0.47 

0.26 

0.100 

0.043 


B 



1.53 

0.78 

0.29 

0.173 

0.081 

0.040 


vals for the lower orders. From this, we see that prior 
choice affects the structure of the tails more significantly 
than the structure of the peak. This is indicative of the 
strength of information coming from the data and the 
prior at different points in the distribution. Though Set 
B results in significantly narrower DOB intervals at low 
fc, the EFT coefficients provide enough information for 
fc > 2 to modify these posteriors into agreement with 
those of Sets C e and A e . 


A comparison of Set A results in Table [iX] with those in 
Table VIII shows that the approximation of keeping only 


the leading omitted term is excellent for the 68 % DOB 
for k > 2 and still quite good for fc = 1 (which is the 
true leading order). This approximation always underes¬ 
timates the interval from including higher-order terms 
and worsens as the expansion parameter Q increases. 
Figures [ 8 ] and [9] show that this result is general and that 
the approximation is better for the 95% interval with a 
less conservative prior for c. One outlier is the fc = 5 pre¬ 
diction at Ti a b = 50 MeV where we see consequences of a 
C 5 coefficient known to have an anomalously large value, 
which is an artifact of the fitting procedure m ■ Note 
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FIG. 8. (color online) Cross sections at 50MeV and 96MeV for all orders from EKM, with DOB intervals at each order using 
a wide variety of prior sets. Note the change in scale at each order. The thick error bars indicate 68% DOB intervals while 
the thin error bars indicate 95% DOB intervals. In each panel the dashed line is the result of the next-order calculation (NLO 
at LO, N 2 LO at NLO, etc.), shown to facilitate an assessment of the statistical consistency of different prior choices. For each 
prior choice, the intervals on the left are from keeping only the first omitted term while those on the right are including four 
omitted terms. The shaded bands indicate the uncertainty from EKM. 


that this results in the omission of the DOB interval for 
N 4 LO at 50 MeV with Set Ao. 5-2 as c> is then less than 
C(fc), so the distribution is not defined in this case. 

Overall, the prior sets A e and C e appear to be too con¬ 
servative for predictions at LO; we know that A e and C e 
have incorporated less information than the alternatives 
so it is no surprise that their posteriors are more widely 
distributed. Importantly, we find that the posteriors for 
for k > 2 are largely insensitive to the choice of prior, 
even for the 95% DOB interval. As posteriors retain ar¬ 
tifacts of the prior in inverse proportion to the strength 
of the data, this similarity suggests that the data is suffi¬ 
ciently informative that any reasonable prior is properly 
subservient and thus able to adapt to evidence of the real 
world presented by the data. 


IV. CHOICE OF EXPANSION PARAMETER 

In the previous section, the scale A& in the expansion 
parameter was taken from Ref. uni, where it was ex¬ 
tracted from error plots after the fit of the LECs. This 
identification was certainly not rigorous in any statistical 
sense. Therefore here we explore how A 5 can be extracted 
from the convergence pattern of the EFT for observables. 

In the case of pQCD, Cacciari and Houdeau discussed 
using an expansion parameter that is different from a s . 
They introduced a scale factor A, so that the expansion 
is in powers of a s /X [ 6 ]. This changes the expressions for 
pr(A/e|co ..., Cfc) by a rescaling of the expansion param¬ 
eter Q and a corresponding rescaling of the coefficients 
themselves. We can rewrite the series for an observable 
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FIG. 9. (color online) Cross sections at 143 MeV and 200 MeV for all orders from EKM, with DOB intervals at each order 
using a wide variety of sets. Note the change in scale at each order.The thick error bars indicate 68% DOB intervals while the 
thin error bars indicate 95% DOB intervals. In each panel the dashed line is the result of the next-order calculation (NLO at 
LO, N 2 LO at NLO, etc.), shown to facilitate an assessment of the statistical consistency of different prior choices. For each 
prior choice, the intervals on the left are from keeping only the first omitted term while those on the right are including four 
omitted terms. The shaded bands indicate the uncertainty from EKM. 


X in terms of the rescaled expansion parameter and co¬ 
efficients as 


X = X 0 ]T(c„A n ) x 

n=0 



(38) 


In an EFT expansion this is equivalent to a rescaling of 
A 5 by a factor A. 

Subsequent papers explored procedures for determin¬ 
ing the value of A based on various criteria: 


In Refs. [3[ 8 ], A was chosen empirically by compar¬ 
ing the consistency of the computed DOB intervals 
with known higher-order calculations. An extra 
factor of (n — 1)! was also introduced along with A 
in Eq. (38)—motivated by effects from renormalon 


chains at higher orders in the expansion. The au¬ 
thors denoted the resulting scheme CH. We have 
no evidence for such a factorial in our EFT expan¬ 
sions and do not consider it further here. 

• In Ref. [45] , it was proposed that with the best ex¬ 
pansion parameter, the coefficients should form a 
normal distribution of mean /i and standard devi¬ 
ation a. This criterion was used to choose a value 
of A. This approach is consistent with naturalness 
for the {c n }, as long as (i and a are both 0(1). 

Here we explore these procedures for tuning the ex¬ 
pansion parameter in the EKM cross sections, and we 
also suggest another criterion for assessing A based on 
the assumption of naturalness in the EFT expansion for 




































































































18 


a particular value of A*,. If a A emerges from such anal¬ 
yses that is measurably different from one, it suggests 
that the true breakdown scale of the EFT expansion is 
not A*,, but instead A^A. Given the limited number of 
coefficients (20 at most) at our disposal from the EKM 
analysis, any statistical procedure can only determine A, 
and hence A*,, within sizable error bars. Our goal in this 
section is less to determine A&, than to establish whether 
the choice A& = 600 MeV is consistent with our other a 
priori assumptions and deductions about the convergence 
properties of the EFT. 


A. Consistency checks based on higher-order 
calculations 

In Refs. [JJ [S3 A was determined by checking the consis¬ 
tency of CH DOB intervals obtained with expansion pa¬ 
rameters a s /\ in several large sets of pQCD observables. 
This is done by examining actual vs. expected success 
rates of the pQCD calculations. As stated in Ref. [[46] : 
“For a finite set of observables and a given model (with 
fixed parameters) at order fc, the success rate is defined as 
the number of observables whose subsequent-order con¬ 
tributions are within the uncertainty interval predicted 
by the model.” 

We want to use the observed success rates n(p)/N for 
our EFT calculation to infer the likelihood that p is the 
true success rate—for many different choices of p. If each 
observable being considered is uncorrelated, the success 
rate should follow a binomial distribution. Therefore the 
likelihood for n successes amongst N observables, given 
P, is 


P'WP. W > = n!( n)! ^(l - P)"-" ' (39) 

We generalize the pdf ( |39| ) to its continuous version, the 
/^-distribution: 

pr(a, b\p, N ) = (i _ p f -1 ; (40) 

with a = n + 1 and b = N — n + 1. We can then compute 
confidence intervals (CIs) on n (or, equivalently a) for 
a given value of p (in practice we will consider only the 
68 % and 95% CIs). This can be done using standard 
integrals, and the result expressed in terms of a range of 
success rates that are consistent with the chosen value of 
P • 

As in Refs. mm, we have calculations of the cross 
sections at several orders and energies and are trying to 
determine values of A that result in consistency between 
assumed values of p and the resulting success rates n. 
To do this, we take the set of 16 observables we have 
from the EKM results: calculations at LO, NLO, N 2 LO, 
N 3 LO for four different lab energies. (Note that each 
observable must have a higher-order result to which it 
can be compared.) We then pick a value of A and proceed 


to assess the consistency of the success rates of the theory 
predictions for that A via this algorithm (adapted from 
pQCD to EFT for our purposes): 

1. Select a grid of p% DOB intervals with p ranging 
from 0 to 100 . 

2 . Use the formalism laid out in Sections ITTI and 1TTT1 to 
compute the p% DOB interval for each observable 
in the set. 

3. For each next-order calculation that is within the 
DOB interval of the previous order, count one suc¬ 
cess. 

4. Take the number of successes and divide by the to¬ 
tal number of observables to get the actual success 
rate. 


5. Compare the actual success rate for this value of p 
with the 68 % and 95% confidence intervals for the 
number of successes if p were the true success rate, 
as computed from the distribution (40). 


This algorithm generates a function of p for this value 
of A. If the curve is within the 68 % Cl for the entire 
range of p values we say that the value of A is consistent 
at la with the performance of the perturbative series. 
Moderate fluctuations outside the la band over limited 
regions of the entire p domain can indicate a statistically 
consistent choice for A, but the concern is with curves 
that end up systematically outside the la region. This 
can occur in one of two ways. If the curve starts to veer 
above the la region then that indicates the EFT pre¬ 
dictions are too successful. The expansion parameter is 
overestimated, which means the EFT breakdown scale is 
underestimated. Alternatively, the function n(jp)/N may 
deviate well below the 68 % Cl, in which case the EFT 
is under-performing compared to statistical expectations. 
In that case the stated expansion parameter is too small, 
i.e., A 5 is overestimated. We note that this interpreta¬ 
tion is somewhat specific to EFT: in a case where we 
were confident of the expansion parameter in the series 
we could instead use this diagnostic to probe whether 
different prior choices are too conservative or too aggres¬ 
sive. 

Here though, we try to draw conclusions on the perfor¬ 
mance of the EFT expansion that are invariant under the 
choice of priors defined above. We thus implement this 
procedure for two different prior assumptions on c and 
the coefficients {c n }. In each case we use the approxima¬ 
tion that the leading term dominates for computational 
ease. The curves do not change substantially if we go 
beyond the first-omitted-term approximation. 

We first consider Set A^. The results of computing 
p% success rates for various values of A are shown by the 
lines in Fig. [lO] We include 68 % and 95% confidence 
bands to evaluate which A curves meet our consistency 
criterion. With only 16 observables, the confidence bands 
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p% 


FIG. 10. (color online) Empirical determination of A by com¬ 
paring results at different orders. The cross sections used are 
the computations with the R — 0.9 fm regulator. Priors are 
Set Ae X \ For full explanation see text. 



FIG. 11. (color online) Empirical determination of A by com¬ 
paring results at different orders. The cross sections used are 
the computations with the R — 0.9 fm regulator. Priors are 
Set Ce^. For full explanation see text. 


are fairly wide, but still the only curve which falls com¬ 
pletely within the 68% interval is A = 1.3. The orig¬ 
inal expansion parameter at A = 1 spends some time 
above the la region, which may reflect that DOB inter¬ 
vals resulting from this prior are too conservative; i.e., 
the actual success rate regularly exceeds the DOB that 
has been assigned. This is consistent with our earlier ob¬ 
servation that Set A e priors produce overly conservative 
DOB intervals. 

We also compute the intervals using Set C^, which ac¬ 
counts for the effects of each coefficient and is less conser¬ 
vative. The results are contained in Fig. El We see that 
even for these assumptions, the A = 1 curve gets outside 
the la band. The plot suggests A = 1.1 is a more con¬ 
sistent choice (other values near A = 1.1 will, of course, 
also be consistent). Because the DOB intervals computed 
with Set Cg 1 ) priors are more informed by the available 
coefficients, this result may suggest a small increase in 
the assigned breakdown scale is appropriate. However, 
we note the small amount of data on EFT convergence 
that is being used here; almost all rescalings considered 
are consistent at the 2a level. Such determinations of A& 
from success rates can be sharpened by considering the 
behavior of the EFT series for more observables. 


B. Gaussian naturalness and the Forte method 


is a mean and standard deviation, in this case a particular 
coefficient c n , the method of maximum entropy results in 
a distribution that is a gaussian [20, [22 q 

pt ( o„ | A,M,<7 ) = ^ e xp(-<ML^L) ^ (41) 


If we have several known coefficients, all of which are 
drawn from a distribution with the same mean and stan¬ 
dard deviation, the joint pdf pr(co,..., c&|A, /i, a) be¬ 
comes the standard likelihood function. If a = c and 
fi = 0 such a distribution corresponds to the Set C prior 
of Table U 

Forte et al. consider the probability distribution for 
both /i and A given a set of {c n } [45]. This can be ob¬ 
tained from © using Bayes’ theorem: 




pr(c 0 , ■ ■ ■, c fe |A, /i, <r) pr(A, 
pr(c 0 ,... ,c k \a) 


(42) 

Forte et al. assign no prior information to A and fi other 
than that both are larger than zero, and neither quantity 
depends on cr a priori. They then take the prior and the 
evidence in the denominator to be an overall normaliza¬ 
tion factor that is independent of A and /i, and so do 
not calculate them explicitly (cf. discussion of a scale- 
invariant prior for A below). The pdf for A and /i can 
then be written 


in Ref. [45], Forte et al. suggest that, for QCD expan¬ 
sions, the best A is the one that makes all the expansion 
coefficients closest to the same size, which they inter¬ 
pret as a statement that the coefficients should be nor¬ 
mally distributed around a single number /i with variance 
a 2 [45]. For a quantity for which the known information 


pr(A,^|c 0 ,...,c fc ,<r) oc pr(c 0 ,..., c k \X, /i, a) , (43) 


meaning that maximizing the probability of A and fi is 
equivalent to minimizing 


i= 1 n=0 \ / 


( 44 ) 
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where {c^n } is the set of EFT coefficients found for the 
ith observable, and No is the number of observables be¬ 
ing used to form the x 2 - In our case Nq = 4: the cross 
sections at the four different energies analyzed by EKM^] 
Note also that for chiral EFT for NN scattering the co¬ 
efficient ci is known to be zero, and so the n = 1 term 
should be omitted from the sum. 

The assumption that A has a uniform prior is not con¬ 
sistent with arguments regarding the invariance of the 
pdf under a change of scale [23j. In fact, A should be 
treated as a scale parameter. So, in contrast to Ref. US], 
we assign a uniform prior to the logarithm of A, resulting 
in a probability distribution for A and (i that is: 

pr(A, n\ca, ..., c k , cr) oc 1 x e - * 2/2 , (45) 


with the parameter space for A and (i restricted to both 
being positive. Assuming a = 1, we find the maximum 
of the probability (43) for the R = 0.9 fm EKM coef¬ 
ficients occurs at A = 0.92, /a = 0.69. To consider the 
pdf of A only, we marginalize over the parameter /a and 
maximize pr(A|co,..., c&, cr) to find A = 1.Oilrig? which 
is consistent with the Forte et al. hypothesis at a 68% 
DOB. Larger cr’s generate still wider ranges. From this 
point of view too, then, A& = 600 MeV is a consistent 
choice for the R = 0.9 fm np scattering EFT-expansion 
coefficients. 


C. x 2 test 

Alternatively, we can demand that the mean of the 
c n ’s be fixed at fi = 0 and that the width a should affect 
the results as in Set C gaussian pdfs on the coefficients, 
where c is an important feature of the prior. This leaves 
us with the probability 

pr(A, [i = 0|co,..., c fe , cr) oc I exp 

(46) 


/ x 2 (m = Q) ^ 


where X 2 (l J - = 0) is given by Eq. (44) with fi = 0. 

We can then test whether, for a given A, the data, i.e., 
the EKM coefficients from their R = 0.9 fm calculation, 
follows a normal distribution with mean zero and width 
cr. We do this by comparing x 2 (m = 0) with the way that 
X 2 should be distributed for a normal distribution with 
19 degrees of freedom.Once again, in order to do this we 
must fix cr. With the choice cr = 1 we find A = 1.09 
gives x 2 of 19—the central value one would expect for 
this many data points Using the rule of thumb for 


3 In general there would be Non c terms in the % 2 sum, but we 
omit the N 3 4 LO coefficient from the 50 MeV cross section, since 
it is clearly an outlier. Our y 2 thus has 19 terms in the sum. 

4 Including the N 4 LO coefficient from the 50 MeV cross section 
lowers the results for A by about 10%. 


large number of degrees of freedom, TV, m that the x 2 
should have a width of y/2N indicates that A could (68% 
DOB) be anywhere between 1.01 and 1.15. As in the 
previous subsection, choices of cr > 1 will increase this 
range of possibilities. 


D. Summary of expansion-parameter checks 

In any case, while none of these methods provide a 
crisp result for A^ from the 19 data points analyzed, 
it is reassuring that there is little evidence for a large 
change in A*,. Minimally, EKM’s estimate A^ « 600 MeV 
for their R = 0.9 fm calculation is consistent with these 
analyses, and the breakdown scale may in fact be a lit¬ 
tle higher. Further investigations employing these tech¬ 
niques with EFT coefficients drawn from many more ob¬ 
servables will provide more definitive answers. 


V. SUMMARY AND OUTLOOK 

We have adapted and extended the Bayesian frame¬ 
work originally introduced in the context of pQCD by 
Cacciari and Houdeau [6] to evaluate truncation errors in 
EFT expansions. Assumptions about the nature of the 
coefficients in the expansion are encoded as priors on the 
coefficients of higher-order terms in the EFT series. The 
pdfs for these coefficients then ultimately also include 
information on the distribution of coefficients at orders 
that are calculated. Here we employed priors derived 
from the notion of “naturalness” of EFT coefficients, i.e., 
the idea that they should be 0(1) when the observable 
and the momentum of the process in question are mea¬ 
sured in appropriate units. We took the coefficients in 
the EFT expansion of cross sections to be natural in this 
sense. Such a choice is uncontroversial for perturbative 
processes, e.g., meson-meson scattering at momenta well 
below the chiral-symmetry-breaking scale. It remains to 
be fully investigated for cross sections in nucleon-nucleon 
scattering, where the relationship between the underly¬ 
ing scales and observables is quite complex; we rely here 
on an empirical validation (see Fig. [5|. 

We investigated the influence of two prior pdfs for EFT 
coefficients on the truncation errors. The first was the 
CH characterization of an upper bound c, the second 
was a Gaussian of width c. We also investigated the in¬ 
fluence of priors on c itself on the results. We did this in 
the context of representative examples in Section |TT] and, 
in Section IH using results from the order-by-order cal¬ 
culations of neutron-proton cross sections by Epelbaum, 
Krebs, and MeiBner (EKM) in Ref. [TT] (obtained with a 
regulator parameter R = 0.9 fm). Combining the insights 
from both sections we find: 

• Priors that reflect a natural size for c give similar 
degree-of-belief (DOB) intervals at the lowest or¬ 
ders. 
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• The resulting error bands are tighter than those for 
which the scale of c is not constrained. 

• For higher orders, 68% DOB intervals show little 
dependence on prior choice; 95% DOB intervals 
have larger, but still quite small, dependence. 

These results have wide applicability to observables— 
they can be used in many EFT contexts. In the case 
of neutron-proton scattering our formulas provide a sta¬ 
tistical interpretation to error bars obtained by EKM in 
Ref. [10]. Their error bar is obtained in the case that the 
distribution of coefficients is uniform, in which case it is 
a j/(j + 1) * 100% DOB interval for the omitted terms 
in a hPLO calculation. But, as already stated, trunca¬ 
tion errors in these calculations at NLO or beyond (i.e., 
which include at least two orders beyond leading) were 
only mildly dependent on prior choice. In particular, the 
68% DOB intervals obtained in our Bayesian framework 
varied by at most 15% amongst all the priors considered 
here, and the variation was less than that in calculations 
beyond NLO. Error bands at a given order were also con¬ 
sistent with a statistical interpretation when compared 
with known higher-order results. Truncation errors at 
leading order are sensitive to prior choice, since-given 
the choice of scaling observable we made—almost no in¬ 
formation on the pattern of coefficients emerges from a 
leading-order calculation. Comparison of the resulting 
error band with the known results of NLO, N 2 LO, N 3 LO, 
and N 4 LO calculations suggests that the CH choice of 
a 0- function distribution for coefficients, and a scale- 
independent distribution for the width of that 0-function, 
is too conservative—at least for this case. Overall then, 
at sufficiently high order, the prior picked from Table [I] 
hardly matters—in practice k = 2 may be enough. At 
lower orders priors provide a rigorous way to explore dif¬ 
ferent assumptions about the pattern of coefficients in 
the EFT. 

Indeed, the application of Bayesian methods to data is 
often criticized because of the apparently subjective se¬ 
lection of prior pdfs. However, the priors manifest what 
would otherwise be implicit assumptions, so that they 
can be tested. The information encoded in those assump¬ 
tions is then modified in light of subsequent data: in this 
case the distribution of low-order coefficients influences 
the distributions computed for coefficients that enter the 
assessment of the truncation error. Furthermore, the de¬ 
velopment of specific pdfs for those higher-order coeffi¬ 
cients allows a statistical interpretation of the “theory 
error”—or at least the part of it that results from the 
truncation of the EFT series. This allows crisp answers 
to questions regarding, for example, how theory error 
bars should be combined, or the extent to which theory 
errors on different quantities are correlated. Those an¬ 
swers may have some sensitivity to the choice of prior on 
the higher-order coefficients, but the advantage of the 
Bayesian framework is that the consequences of prior 
assumptions about the distribution of coefficients (Uni¬ 
formly distributed or Gaussian? Natural or Scale-less?) 


can be traced through to the statistical uncertainties on 
the EFT calculation. Those assumptions can then—if 
necessary—be refined. 

Such refinement may be necessary in light of the need 
to identify an EFT breakdown scale before extracting 
the (supposedly) 0(1) coefficients which are input to our 
analysis. Mis-identification of the breakdown scale is one 
manner in which a particular prior could fail. But, in 
this case, we showed in Sec. |IV| that this breakdown scale 
A 5 = 600 MeV leads to success rates taken from the EFT 
predictions at four different energies, and for four differ¬ 
ent orders, that are statistically consistent with the DOB 
intervals resulting from our Bayesian formalism. Further¬ 
more, the distribution of coefficients with the R = 0.9 fm 
regulator choice is consistent with a Gaussian distribu¬ 
tion. Qualitatively a natural distribution is not seen for 
the coefficients obtained using a second, larger, value of 
R. The calculation at this larger regulator radius reflects 
cutoff artifacts, which leads to peculiar convergence of 
the EFT expansion. The breakdown of the EFT is then 
not set by A&, but by the effects of this softer cutoff. 
With the general formalism for probability distributions 
of EFT coefficients laid out here it will be important to 
check when the EFT coefficients obtained over a wide 
range of cutoff values and observables are empirically 
consistent with the application of naturalness priors to 
observables in the NN system. 

The Bayesian approach to error estimation presented 
here is an alternative to procedures that calculate error 
bands based on variation of the EFT regulator, which 
could be a cutoff in either momentum or coordinate 
space. While variation with regulator scale gives a lower 
bound on the uncertainty (theories should, after all, be 
regulator invariant up to higher-order terms) the result¬ 
ing error band has no statistical interpretation. A partic¬ 
ular flaw is the arbitrariness of the interval in which the 
cutoff is varied; for QCD this is only of mild concern, be¬ 
cause the dependence on the regulator parameter is only 
logarithmic. But running in the chiral EFT applied to 
NN scattering is much faster: it can contain high positive 
powers of the regulator (momentum) scale. This concern 
is exacerbated by the narrow range that is possible be¬ 
fore encountering irremediable cutoff artifacts or spurious 
deep bound states. It is also the case that residual cutoff 
dependence only reflects the contribution from omitted 
contact operators. These only enter the chiral expansion 
for NN observables at even orders, and so examination 
of cutoff dependence alone may substantially underesti¬ 
mate the EFT truncation error. More generally, when 
computed using only cutoff variation, the error bands for 
predictions of observables (as opposed to quantities used 
to fit EFT LECs) generically exhibit undesirable system- 
atics (e.g., sometimes growing wider with order) and of¬ 
ten underestimate the error when compared with actual 
higher-order calculations [4]. In contrast, the Bayesian 
assessment of truncation errors laid out here is appli¬ 
cable to all EFTs, admits a statistical interpretation of 
truncation errors, is justified when regulator parameters 
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cannot be varied widely, and predicts decreased errors at 
all orders—not just when new LECs are added. 

The truncation-error assessment described here is just 
one piece of a broader framework for EFT uncertainty 
quantification using Bayesian methods. We have under 
development analogous procedures, together with a suite 
of diagnostic tools, for parameter estimation and the as¬ 
sessment and propagation of errors—both statistical and 
truncation—in fitted LECs and predicted observables. 
Bayesian model selection is also well suited for address¬ 
ing fundamental questions in nuclear EFT, such as the 
comparative efficacy of theories with different degrees of 
freedom, from pionless to chiral EFTs with and without 
an explicit A(1232). 
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